Skip to content

[Mellanox] Add CPU thermal control for Nvidia platforms#10202

Merged
sujinmkang merged 2 commits intosonic-net:masterfrom
Junchao-Mellanox:4800-thermal
Mar 21, 2022
Merged

[Mellanox] Add CPU thermal control for Nvidia platforms#10202
sujinmkang merged 2 commits intosonic-net:masterfrom
Junchao-Mellanox:4800-thermal

Conversation

@Junchao-Mellanox
Copy link
Copy Markdown
Collaborator

Why I did it

Add CPU thermal control for Nvidia platforms which will be enabled for platforms that have heavy CPU load. Now it is only enabled on 4800, and it will be enabled on future platforms.

How I did it

Check CPU pack temperature and update cooling level accordingly

How to verify it

  1. Manual test
  2. Added sonic-mgmt test case, PR link will update later

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

keboliu
keboliu previously approved these changes Mar 11, 2022
@keboliu
Copy link
Copy Markdown
Collaborator

keboliu commented Mar 15, 2022

@sujinmkang would you please help to review?

@sujinmkang sujinmkang dismissed liat-grozovik’s stale review March 21, 2022 16:54

change request looks addressed all.

@sujinmkang sujinmkang merged commit f0ddd10 into sonic-net:master Mar 21, 2022
liat-grozovik pushed a commit to sonic-net/sonic-mgmt that referenced this pull request Mar 24, 2022
…#5309)

Add test case for PR sonic-net/sonic-buildimage#10202

- How did you do it?
Mock changing CPU temperature and verify cooling level is updated by algorithm accordingly

- How did you verify/test it?
Run the new test and passed.

- Any platform specific information?
Mellanox/Nvidia SN4800 platform
xwjiang-ms pushed a commit to xwjiang-ms/sonic-mgmt that referenced this pull request Apr 13, 2022
…sonic-net#5309)

Add test case for PR sonic-net/sonic-buildimage#10202

- How did you do it?
Mock changing CPU temperature and verify cooling level is updated by algorithm accordingly

- How did you verify/test it?
Run the new test and passed.

- Any platform specific information?
Mellanox/Nvidia SN4800 platform
@Junchao-Mellanox Junchao-Mellanox deleted the 4800-thermal branch June 9, 2023 07:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants